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ABSTRACT 


Effective feedback can reduce building power consumption and carbon emissions. Therefore, providing 
information to building managers and tenants is the first step in identifying ways to reduce power 
consumption. Since reducing anomalous consumption can have a large impact, this study proposes a 
novel approach to using large sets of data for a building space to identify anomalous power consumption. 
This method identifies anomalies in two stages: consumption prediction and anomaly detection. 
Daily real-time consumption is predicted by using a hybrid neural net ARIMA (auto-regressive integrated 
moving average) model of daily consumption. Anomalies are then identified by differences between 
real and predicted consumption by applying the two-sigma rule. The experimental results for a 


Big data analytics 


. 17-week study of electricity consumption in a building office space confirm that the method can detect 
Anomaly detection 


Pattern recognition anomalous values in real time. Another contribution of the study is the development of a formalized 
Real time detection methodology for detecting anomalous patterns in large data sets for real-time of building office space 
Time series prediction energy consumption. Moreover, the prediction component can be used to plan electricity usage while 
the anomaly detection component can be used to understand the energy consumption behaviors of 

tenants. 
© 2014 Elsevier Ltd. All rights reserved. 
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1. Introduction 


* Corresponding author. Tel.: +886 2 2737 6321; fax: +886 2 2737 6606. . ; . 
E-mail addresses: jschou@mail.ntust.edu.tw, The OECD has predicted that global energy consumption will 


rayson.chou@gmail.com (J.-S. Chou), D9915802@mail.ntust.edu.tw (A.S. Telaga). increase 53% from 505 quadrillion Btu in 2008 to 770 quadrillion 


http://dx.doi.org/10.1016/j.rser.2014.01.088 
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Btu in 2035 [1]. Compared to the transportation and industrial 
sectors, the building sector consumes more energy (approximately 
40% of global energy use) and generates 30% more CO» [2]. 
Therefore, a critical step in lowering carbon is reducing energy 
consumption in buildings. Given the particularly high dependence 
of Taiwan on imported fossil fuels, developing an economical, low- 
carbon, and highly efficient green energy system is imperative [3]. 

Studies performed in the United Kingdom [4] and in the United 
States [5] show that the growing use of energy-consuming equip- 
ment beat efficiency gains in green building technology. The 
increased energy consumption is mainly due to equipment for 
maintaining comfort in residential and commercial buildings, such 
as air conditioners, heaters and other modern appliances [5]. 
However, energy consumption in commercial buildings is more 
complex than that in residential buildings [6]. 

While residential buildings mainly provide a sanctuary for 
people, commercial buildings have widely varying purposes. 
Nevertheless, commercial buildings are mainly designed for busi- 
ness activities and expected to generate income for building 
owners and their tenants. Therefore, energy-saving strategies are 
needed to reduce operating costs on both sides. Specifically, 
electricity consumption by commercial buildings is the highest 
during 9:00-17:00, which is usually the highest price in time-price 
based schemes. Moreover, Popescu et al. also found that energy- 
efficient buildings benefit owners by increasing the property 
values [7]. 

The building manager is responsible for managing building 
performance, and one of the main building performance measures 
is electricity consumption [8]. Additionally, in countries that have 
recently increased requirements for green building certification, 
the building manager must minimize energy consumption. Thus, 
to reduce electricity consumption and CO, emissions, building 
managers must understand energy consumption from the tenant 
perspective. Therefore, building electricity consumption is both a 
social problem and a technical problem [5]. 

Analyzing electricity consumption from the tenant perspective 
requires very detailed data. To acquire such data, researchers have 
proposed using sensors for detecting movement [9], thermostats 
[10], cameras [11] or combinations of sensors that detect light, CO3, 
temperature, etc. [12]. In practice, however, implementing this 
approach in commercial buildings is highly impractical. For privacy 
reasons, some tenants may reject the idea of sensors installed in their 
offices. Moreover, wiring costs are 45% and 75% of total installation 
cost for new buildings and retrofitted buildings, respectively [13]. 
Analyzing data streams from numerous real-time sensors can also be 
a heavy burden on building energy managers [14]. 

Smart meter use can reduce the required number of sensors and 
eventually reduces data stream volume. A smart meter is an electrical 
meter that records electrical energy consumption at intervals of an 
hour or less and sends the information back to the utility center for 
monitoring and billing purposes [15]. Therefore, smart meters 
provide more information compared to conventional meters, which 
only provide data for billing purposes [15]. Moreover, a smart meter 
management system is needed for an efficient smart grid system 
[16]. Finally, customers benefit from improved reliability of utility 
networks [17] and improved responsiveness of services, which 
eventually improve and sustain the customer relationship [18]. 

Additionally, smart meter data can be utilized to provide power 
quality (PQ) information to customers and utility companies. As the 
quality is susceptible to any disturbance in power transmission 
network, PQ is an important measure for customers | 19]. Particularly, 
for the buildings that use electricity from different companies, the 
companies could develop PQ index [20] to provide fair information to 
customer and use the index to monitor any disturbance in power 
quality production. Consequently, for a fairer energy price, the price 
can be adjusted in terms of power quality [21]. 


Smart meters can provide detailed data for the electricity 
consumption of a customer in real-time or near real-time. Further, 
in-home implementations combining smart meter and enabling 
technologies such as in-home display have shown that smart 
meters can reduce energy consumption [22]. Studies show that 
the highest reductions occur when people are already at home at 
17:00 (5 pm), which indicates that, with the right feedback, people 
can reduce their electricity consumption [23]. For example, a study 
by the Energy Saving Trust in 2009 showed that feedbacks that 
had the largest contribution to smart meter use were those that 
helped to reduce electricity use [24]. 

Anomalous electricity consumption data can help tenants identify 
extraordinary consumption patterns [25]. In commercial buildings, 
anomalous consumption may also result from activities such as over- 
lighting [6], inefficient equipment or overtime work. Therefore, 
anomalous feedback data can be further used to warn tenants to 
minimize electricity use and to help them identify ineffective equip- 
ment or over-lighting in office spaces. However, extracting meaningful 
information from smart meter data is a formidable task [26]. 

Although several anomaly detection methods have been 
researched, the primary objective has been detecting anomalous 
consumption in automated building systems such as heating, 
ventilation, and air conditioning (HVAC) systems [14,26-28]. 
However, the building must also support random use of office 
equipment, lightings, heating, and air condition. Since HVAC 
systems consume almost 50% of energy in a building [8], reduction 
of energy use by non-HVAC systems can potentially reduce total 
consumption by 50%. Office equipment consumes 15% of the total 
energy consumed by an office. By 2020, this figure is expected to 
increase twofold [29]. Therefore, potential savings in electricity 
consumption by office spaces are also large. 

Because no studies have considered anomaly detection in office 
spaces, this study performed an experiment to develop a real-time 
system for detecting anomalous electricity consumption in an 
office space from the perspective of occupant activities. All 
experimental data were retrieved from smart meters used to 
monitor electricity consumption in an office space in a university 
building. The main objective was to develop an anomaly detection 
methodology that is applicable in large data stream of smart meter 
data and real time environment. Therefore, the research results 
have potential applications in a web-based early warning system. 
Notably, the results application is not only limited to building 
energy consumption domain, but also applicable to any anomaly 
detection system that use time based sensor data as input. 
Furthermore, the potential application includes gas flow detection, 
water flow detection, and comfort level detection. The main 
contributions of this research are the following: 


e A formalized methodology for detecting anomalous patterns in 
large real-time datasets for building office space energy 
consumption. 

© The method is performed in two stages. The prediction stage 
helps building managers plan their electricity demand while 
the anomaly detection stage helps building managers identify 
tenant consumption patterns. In the case of a building that 
generates its own electricity and has abnormally low energy 
consumption, the building manager can connect to a smart grid 
and sell the excess electricity to gain profit. 

e Anomaly detection benefits tenants by helping them under- 
stand how their office activities consume energy. They can then 
modify their anomalous activities, analyze energy consump- 
tion costs and benefits, and eventually reduce their wasteful 
activities. 


The remainder of this paper is organized as follows. Section 2 
briefly introduces the study context by reviewing related 
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literature, including studies of anomaly detection and some well- 
known demand prediction techniques. Section 3 then describes 
the research methodology, and the evaluation methods of pro- 
posed models. Section 4 further explains the experiment per- 
formed in this research. Section 5 then presents the experimental 
results, and Section 6 analyzes the results and compares model 
performance. Finally, Section 7 summarizes the findings and 
conclusions. 


2. Review of literature on research problem 


Analyzing building electricity consumption data is important 
because failure to do so can jeopardize building management by 
causing excessive energy use and potential increases in carbon 
taxes [30]. Although building energy consumption has been 
studied intensively, further studies are needed to optimize artifi- 
cial intelligence (AI) for prediction models and to integrate the 
models in a Building Energy Management System (BEMS) [31]. 
In the future, most smart meter systems will be Al-based to enable 
independent management of power consumption [32]. 

Electricity consumption in an office space is usually estimated 
for a typical working day and working week. However, electricity 
demand signatures differ according to occupant behavior and 
during different periods, e.g., lunch time, regular workdays, and 
seasonal holidays [26]. Therefore, an anomaly detection system for 
an office space must detect anomalous consumption based on 
these signatures and pattern changes during seasonal holidays. 

Here, an anomalous condition is defined as an abnormal power 
consumption usage. An anomalous state is defined as a deviation 
from the normal electricity consumption of the tenant. Therefore, 
the proposed anomaly detection model has two stages: power 
consumption prediction method and anomaly detection. Because 
electricity consumption data is a time series domain, the objective 
was to develop a suitable time series anomaly detection method. 
By defining an anomalous state as two standard deviations (SDs) 
above or below the predicted power consumption, an anomaly can 
be easily computed and flagged. The definition is based on 
empirical rule of normal distribution that 95% of values lie within 
roughly two SDs of the mean. Therefore, another 5% value outside 
two SDs can be considered as an anomaly as depicted in Fig. 1. 
Similar definition was used in Brown et al. that they defined 
anomalous activity as consumption exceeding an SD of three in the 
prediction results [33]. 

Studies of anomaly detection methods in the energy consump- 
tion domain include Yi et al., who compared regression, entropy, 
and clustering methods. The regression methods obtained the best 
detection results [34]. Brown et al. further used kK-nearest 


Normal Distribution 


Fig. 1. 2-Sigma rule of normal distribution. 


neighborhood (K-NN) in fast kernel regression to predict electri- 
city consumption [33]. However, both methods require huge 
datasets, and the resulting models are static. Since they are prone 
to pattern change, they are not the preferred models for on-line 
prediction [35]. 

When using large data sets to solve problems and identify 
pattern changes, researchers have combined sliding window 
datasets with other techniques such as adaptive artificial neural 
network (ANN) [36]. Even when the dataset is small, adaptive ANN 
outperforms static ANN when using a real measurement dataset 
and performs comparably to static ANN when using a static 
dataset. Wrinch et al. applied Fourier transform with a sliding 
window and found that a weekly window provided faster fault 
detection compared to a monthly window [26]. Li et al. performed 
a time-series auto-regressive integrated moving average (ARIMA) 
analysis of a dataset for real-world daily shifts in water consump- 
tion to detect meter stilting [27]. 

However, anomaly detection by Fourier transformation has a 
high false positive rate due to the assumption of constant 
periodicity of data [35]. The ARIMA method does not obtain a 
good model if the duration of anomaly data is long [37]. Optimiz- 
ing the hidden layer and time lag is also problematic when 
applying ANN in time series domain [38]. Therefore, a suitable 
method is needed to address these limitations. 

Recent studies suggest that a combination of several individual 
models can compensate for deficiencies of a model. Theoretically, 
hybridization of unrelated models also reduces generalization 
variance or error [39]. Since forecasting problems in real-world 
time series data usually contain both linear and nonlinear compo- 
nents [38], the hybrid model usually combines linear and non- 
linear models. In the energy demand prediction domain, ARIMA 
and ANN are the most common forecasting methods [40,41]. 
Individual ARIMA models have been widely used for linear time 
series forecasting [39,41], and ANN has been successfully used to 
solve nonlinear problems [42]. 

Zhang confirmed that a hybrid model combining ARIMA and 
ANN is better than either of the models used independently [38]. 
Hybrid models have been successfully applied in economic time 
series forecasting [43], fuel wood price [44], wind speed forecast- 
ing [45], and electricity price [46]. Although there is still no 
consensus of best approach in combining ANN and ARIMA [42], 
hybridization of ARIMA and ANN or other soft computing has been 
proved to improve energy demand forecasting [40]. Until now, 
however, these methods have only been applied to small data 
streams. Their effectiveness in large data streams for electricity 
consumption is still unknown and deserved further investigation. 
In this sense, this study proposes a hybrid ANN and ARIMA model 
with a sliding window for analyzing large data streams in the 
power consumption domain. 


3. Methodology 


The experimental methodology was based on a case study. 
First, a K-means algorithm, a data mining cluster analysis [47], was 
used to categorize daily consumption patterns in a week. The 
analysis of K-means algorithm results suggested that electricity 
consumption patterns differ each day except on weekends. 
A correlation analysis was then performed to determine whether 
training data are better suited for a weekly or daily analysis. The 
results were consistent with our observation that training data 
were best presented by weekly window since the office occupants 
were students who attended class on a weekly basis. 

After arrangement of the consumption data, the data were 
entered into an ANN-ARIMA hybrid model using NNAR (Neural 
Network Auto Regressive), a function in forecast library of R 
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Fig. 2. Anomaly detection flowchart. 


statistical software. The R statistical software is a comprehensive 
software applicable to handle data manipulation, calculation and 
graphical visualization [48]. Sliding windows for 4-week and 
8-week datasets were used for training sets in the NNAR model 
to form dynamic models. The resulted models then were stored in 
database as a predicted consumption for next week electricity 
consumption. Fig. 2 shows the flow chart of anomaly detection 
process. 

The anomalous states computed by calculating differences 
between actual consumption and predicted consumption were 
then flagged when time duration exceeded anomalous time 
allowed. The prediction results were then evaluated using mean 
absolute percentage error (MAPE), mean absolute error (MAE), and 
root mean square error (RMSE) for accuracy evaluation purpose. 
Therefore, three evaluation methods were utilized to represent 
deviation between actual electricity consumption and predicted 
consumption. 


3.1. K-means algorithm 


The K-means algorithm is one of the simplest unsupervised 
learning algorithms for solving clustering problems [47]. This 
algorithm is also one of the most popular and widely used 
partition clustering methods. The procedure is simple; firstly the 
algorithm classifies a set through a definite number of clusters. 
After finding the cluster centers, the algorithm positions the 
centers as remotely as possible. Finally, the algorithm affiliates 


all data points from the dataset with the closest centers. The first 
iteration is complete when no data points remain. The iterations 
continue until all centers are determined. 

The K-means algorithm searchers for the cluster centers 
(C1, C2, ...,Cķ) such that the sum of the squared distances (called 
distortion) of each data point (x;) to its nearest cluster center (cx) is 
minimized (Eq. (1)), where d is the distance function of the 
Euclidean distance [47]: 


D= Š [min dico” k=1,2,...,K (1) 


t=] 


© Step 1: Assign cluster number K and initialize the centroids of 


each cluster, (c®,c®,...,c®); each cluster center is an m- 
dimensional vector 1, 2, ..., c? = {c\?’, c®, ..., c9). 


© Step 2: Start the iterative procedure. Set iteration count t to 1. 
e Step 3: Calculate the distance measure d\,~') between the kth 
cluster center and the ith data set (data point in m space). Here, 
the distance is defined as the Euclidean distance given by the 


following equation: 


(t—1) f= 
dyi = ||Xi — Ck I| = 


(2) 


© Step 4: Assign each data object x; to its nearest cluster center Cp, 
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© Step 5: Update each cluster center c® as the mean of all x; that 
are closest according to the following equation: 


t Dx; eX ` 

k7 Nk (3) 
where nx is number of data items in the kth cluster. 

© Step 6: Use Eq. (3) to calculate distortion D, which depicts the 
sum of all intra cluster distances. A low D is preferable. 

© Step 7: If the value of D has converged, return the final cluster 
centers (c\”, c9, ...,c\). Otherwise, set t=t+1, and return to 
step 3. 


3.2. Artificial neural networks 


Artificial neural networks (ANNs) are information- 
processing units that function similarly to neurons in the 
human brain except that a neural network consists of artificial 
neurons [49]. The structure of an ANN contains many such 
neurons connected systematically. The feed-forward neural 
networks used here are also known as multilayer perceptrons 
(MP). The quantifiable data used for problem solving are fed 
into the input layer and then processed by the self-updating 
and self-learning model in the hidden layer. The resulting 
solution is then sent to the output layer. The mathematical 
model for MP is: 


n 
a=o(b+ > ws) (4) 
i=1 
where 


z is the forecast value; 

@ is the activation function; 
w is the vector of weights; 
b is the bias; and 

n is the number of neurons. 


In the hidden layer, the activation function is often selected as 
the logistic sigmoid function. 
s(Z) = 


Ite a 


3.3. Auto-regressive integrated moving average 


The ARIMA (auto-regressive integrated moving average) is a 
time series forecasting model [38,50] that uses time-series sta- 
tionary data. Therefore, the data must be made stationary by 
differencing d times. Auto-regression is a forecasting equation 
term that explains lags of the time series. Furthermore, lag forecast 
errors are explained by a moving average term in the forecasting 
equation. Lastly, integration explains the addition of those two 
series. Eq. (6) below depicts the non-seasonal ARIMA model as 
“ARIMA(p,d,q)”: 


p q 

Tt=Po+ È PMe-ita—-— DY Oiti (6) 
i=l i= 

where 


p is the number of autoregressive terms; 

d is the number of non-seasonal differences; 

q is the number of lagged forecast errors in the prediction 
equation; 

¢ is the autoregressive constant; 

8 is the moving average constant; 


t is the number of time series data items; 
r is the forecast value; and 
a is the moving average value. 


3.4, Neural network auto regressive 


The NNAR forecasting model is a hybrid ANN-ARIMA model in 
which the neural network uses lagged values of the time series as 
inputs. Since the model uses one hidden layer feed-forward 
network in which the inputs are lags 1 to p, the model uses p last 
observations [50]. In the ANN modeling stage, the model starts 
with a random weight and then applies the adjusted weight when 
performing the forecasting computation. The network is trained 
for one-step forecasting and uses a recursive calculation for multi- 
step forecasting. Therefore, the mathematical formula for NNAR 
becomes: 


k p 
Ye=WwWo+ È b-e(was D Won) He (7) 
j=l i=1 


where 


j=1,2,...,k is the number of neurons; 
i=1,2,...,p is the lag; 

Wo, is a constant; 

wj is the connection weight where j = 1, 2,...,k; 
g is the activation function in Eq. (5); 

Woj is a constant at neuron j; 
wy is the connection 
1= 1,2562.;D, J = 152, «03k; and 
€t is an error term. 


weight where 


In practice, the ANN performs a nonlinear model of the last p 
observations with k neurons where 


Ve =fYt-1s ---Yt-p W) + Et (8) 


Yı is the predicting value; 

p is the lag number; 

w is the weight for all parameter; and 
€t is the error term. 


The resulting NNAR(p,k) model resembles a model that uses p 
lagged inputs and k nodes in hidden layer. For example NNAR(8,9) 
indicates a neural network that uses the previous eight values 
(Yt-1-Yt-2» ----Yt—g) as inputs for the neural network and uses nine 
neurons in hidden layer. A NNAR(p,0) model resembles an ARIMA 
(p,0,0) model but does not have the parameter restrictions used to 
ensure stationarity. 


3.5. Evaluation method 


To evaluate the accuracy of electricity demand, several criter- 
ions are used. There are different alternative methods for this 
purpose: MAPE, MAE and RMSE are defined as follows: 


x 100 (9) 


2i i-YiP (10) 
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-Yil 


MAE = | 
n; 1 


1 


I M= 


(11) 
where 


Y, predicted value; 
Y;, observed values; and 
n, sample size. 


The MAPE is useful for evaluating the performance of predictive 
models because of its relative values. Since the MAPE is unaffected 
by the size or the unit of the observed and predicted values, it 
indicates their relative difference. A MAPE value lower than 10% 
indicates high forecasting accuracy. Values of 10-20%, 20-50%, and 
over 50% indicate good, reasonable, and inaccurate forecasting 
accuracy, respectively. 

The RMSE formula calculates the square error of the prediction 
compared to observed values and calculates the square root of the 
summation value. Since the errors are squared before calculating 
the average, values with large errors are weighted most heavily. 
Therefore, this measure is effectively reveals unacceptably large 
differences. In contrast, MAE by definition calculates the average 
magnitude of errors between predicted and observed values 
without considering the direction of errors. That is, since all 
individual differences are weighted equally, MAE can measure 
continuous variables. 


405 


4. Experimental design 


An experiment was designed to measure actual power consump- 
tion data. Although the experiment was performed in a laboratory 
located in a university office building, the experiment was performed 
in a real-life setting. The office occupants performed daily activities 
without being given suggestions for improving energy efficiency. 
Therefore, the power consumption data reflected the real-life activ- 
ities of office occupants. Variations in power consumption resulted 
from people activities inside the office. The commercially available 
smart meters used for the experiment were connected to the server 
operated by the manufacturer. 

Fig. 3 shows the steps of the anomaly detection process. Power 
consumption data in this experiment were obtained only from smart 
meter 1, which was used to measure electricity consumption in an 
office space without centralized HVAC. The data were then trans- 
ferred to the application server for further processing. The 
application-server then stored the data and automatically uploaded 
the data to a database. The database preparation stage restored 
missing values by interpolating two adjacent data items and then 
formatting them for further modeling. 

The forecast package in R language was used to predict electricity 
consumption. Finally, after saving the prediction results in the 
database, the anomaly detection procedure was performed. The 
database then received the real-time power consumption data from 
the smart meter and calculated the difference by two-sigma rule to 
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Fig. 3. Stages of anomaly detection process. 
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detect anomalous electricity consumption behavior. The current and 
predicted energy consumption data could then be accessed by users. 

Fig. 4 shows how the smart meters used in the experiment were 
installed in an office space. Smart meter 1 recorded the electricity 
consumed by the computers, printers, scanners, ventilation fans, 
projectors, phones, servers, NAS servers, electricity plugs, ceiling fans, 
hub, and lamps inside the office space. In the office space used for the 
experiment, 28 out of 30 main electricity sockets were in use, and 50 
out of 55 additional electricity sockets were in use. The room also 
had 20 LAN sockets. Thirteen people used the office from 9:00 to 
18:00. Since the electricity consumption data were obtained for the 
real-life activities of office occupants during each minute, data were 
collected 1440 times daily. 


5. Results 
5.1. Data set 


The experiment was performed using office power consump- 
tion data collected from a smart meter and stored in a database. 
The dataset comprised power consumption data samples obtained 
once per minute during the 17-week period from October 22, 2012 
to October 7, 2013. The forecasting performance of the models was 
assessed by dividing each dataset into training and testing sets. 
The datasets were also rolled every week as new data arrived to 
obtain a sliding window. Therefore, the 8-week and 4-week sliding 


windows were the training dataset, respectively. The testing 
dataset was one week after training. 

The ARIMA calculations for both hybrid models were performed 
in a similar environment, i.e., the R library. The hybrid model was 
performed using NNAR function and ARIMA predictions were 
implemented using auto ARIMA function. The same environment 
was used for both models because the research goal was to develop 
an applicable real-time detection system. Therefore, a suitable 
method identified in this research could later be used in a complete 
system. Predictions of electricity consumption during the following 
week were based on 8 or 4 weeks of rolling training data. 

For example, data for weeks 1-8 were used to predict elec- 
tricity consumption for week 9 while data for weeks 2-9 were 
used to predict consumption for week 10. The same procedure was 
used for 4 weeks of rolling data, i.e., data for weeks 5-8 were used 
to predict consumption for 9, and so on. The data used to build the 
forecasting model for each of the considered weeks included 
electricity consumption for each minute of the 8 weeks and 
4 weeks previous to the first day of the week whose consumption 
are to be predicted. Standard ARIMA predictions based on 8-week 
and 4-week training data were also used for comparison with the 
NNAR method. 


5.2. Training results 


Table 1 shows both the NNAR and ARIMA models accuracy 
based on the training data set. Particularly, the NNAR's MAPE 
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MAE 
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0.006 
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0.006 
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MAPE RMSE 
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Table 1 


values for 8 weeks of training data have a minimum value of 
1.852%, a maximum value of 3.597% and an average value of 
2.684%. The MAPE results for the 8-week NNAR training model 
revealed a satisfactory fit to the data. The 4-week training data had 
a minimum value of 1.252%, a maximum value of 3.752% and an 
average value of 2.490%. The results for the 4-week NNAR training 
model also showed a good data fit. 

Moreover, RMSE evaluation showed that the 8-week model had 
a minimum value of 0.046 kWh, a maximum value of 0.286 kWh, 
and an average value of 0.083 kWh. Similarly, the 4-week model 
had a minimum value of 0.026kWh, a maximum value of 
0.288 kWh, and an average value of 0.088 kWh. The experimental 
results showed small RMSE values for both models. As for MAE, 
the 8-week model had a minimum value of 0.02 kWh, a maximum 
value of 0.04 kWh, and an average value of 0.0276 kWh. The 4- 
week model had a minimum value of 0.006 kWh, a maximum 
value of 0.045 kWh, and an average value of 0.024 kWh. Likewise, 
the results indicated a very good data fit in the 4-week model. 

Further comparisons between the NNAR results and the ARIMA 
results showed that the NNAR results were slightly better for both 
8-week data and 4-week data. All RMSE, MAE and MAPE results 
for the NNAR were smaller than the ARIMA in the 4-week model. 

Specifically, standard ARIMA modeling using 8-week and 4- 
week data both gave MAPE result between 1.252% and 3.752%, 
RMSE between 0.026 kWh and 0.288 kWh and MAE between 
0.006 and 0.045. The modeling results confirmed that the 4- 
week NNAR method had a lower error rate compared to ARIMA. 


5.3. Prediction results 


Table 2 presents the prediction results obtained using both the 
NNAR and ARIMA models for real-time prediction during the 
following week. The MAPE for the 8-week NNAR model had a 
minimum value of 0.72%, a maximum value of 18.39% and an 
average value of 10.69%. The results showed that the prediction 
accuracy fell within acceptable MAPE that is around 10%. Further- 
more, considering random office occupants activities, MAPE con- 
sumption range are all below 20%. Therefore, the prediction results 
showed a satisfactory data fit in the 8-week NNAR model. 
Similarly, 4-week model obtained a minimum value of 0.38%, 
maximum value of 18.64%, and an average value of 10.66%. 
Therefore, the 4-week model also had a very good data fit. 

The RMSE evaluation showed that the predictions obtained by 
the 8-week NNAR model had a minimum value of 0.020 kWh, a 
maximum value of 0.35 kWh and an average value of 0.156 kWh. 
Likewise, prediction results for the 4-week model had a minimum 
value of 0.02 kWh, a maximum value of 0.51 kWh and an average 
value of 0.158 kWh. The maximum value of 8-week NNAR was 
0.35 kWh which occurred on Tuesday as depicted in Table 2. The 
power consumption in the office increased because that is the day 
before weekly meeting. Everyone is preparing for meeting pre- 
sentation and working progress report. Similarly, the maximum 
value of 4-week NNAR is 0.51 kWh which occurred on Wednesday, 
the day of weekly meeting. Furthermore, the values indicate the 
office occupants’ activities increased during those two days. 
In view of the daily consumption of approximating 1 kWh, the 
average difference of 0.156 kWh and 0.158 kWh are considered 
satisfactory. Therefore, the NNAR models based on 4-week and 
8-week test data had a very good data fit. 

Further analysis of the MAE results in the 8-week NNAR model 
showed a minimum value of 0.01 kWh, a maximum value of 
0.18 kWh and an average value of 0.096 kWh, which indicated a 
very good data fit. In the 4-week NNAR model, the MAE results 
had a minimum value of 0.00 kWh, a maximum value of 0.18 kWh 
and an average value of 0.098 kWh. Therefore, the model fit very 
well in both NNAR 4-week and 8-week data streams. 
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Table 2 


As in the training data set, NNAR results were compared with 
ARIMA results in both the 8-week and 4-week models. The test 
results were identical to the training results, which confirmed that the 
performance of the NNAR model was slightly superior. However, the 
indicators are quite deceptive since the main advantage of the NNAR 
model over standard ARIMA is its capability to reveal consumption 
patterns (Fig. 5), which is not possible in standard ARIMA (Fig. 6). 


6. Analytical results and discussions 
6.1. Anomaly detection results 


The anomaly detection procedure applied the two sigma rule 
which classifies any points outside of 2 SD from the mean as 
anomalous data. However, further observation of daily activities 
showed that the occupants occasionally used the printer, which 
instantaneously increased electricity consumption by over 2 SD. 
Therefore, another rule was included so that printing was not 
defined as anomalous activity. Observations also showed that 
printing activity usually lasted less than 5 min. The final rule then 
considers electricity consumption anomalous if the consumption 
exceeds the prediction by 2 SD at least 5 min. Although the rule is 
only applicable in this case, the building manager or tenants can later 
adjust the rule according to their own requirements. Likewise, Fig. 7 
shows examples of anomalous activity, which are indicated by large 
gaps between predicted and actual consumption. 

Table 3 depicts the anomaly detection results obtained by 
NNAR method based on 8-week data and 4-week data. 
The detection results show that NNAR can identify consumption 
patterns and detect anomalies. Furthermore, the method can also 
differentiate normal and anomalous consumption precisely. After 
excluding Monday of week 12, the 8-week and 4-week models 
achieved only 51.25% and 51.32% accuracy, respectively, due to 
incomplete previous historical data. The NNAR can predict power 
consumption with 76.46-99.65% accuracy for the 8-week data 
window (average 89.1-96.5%). For 4-week data window, the 
average accuracy is 86.8-94.72% with normal consumption 
between 76.25% and 100% except for Thursday on week 9. 

However, Table 4, which contains consecutive ARIMA results for 8- 
week data and 4-week data, shows that the ARIMA models could not 
identify consumption patterns. The methods were also unable to 
differentiate between normal and anomalous consumption when 2- 
sigma rule was applied. Thus, Table 4 shows that most incorrectly 
categorized data were classified as anomalous. Moreover, 60.47% of 8- 
week results and 55.81% of 4-week results were categorized as 
anomalous consumption. The result shows that, after 9 weeks, the 
8-week data provided normal consumption values between 0 and 
100% with average values between 33.45% and 38.31% except for 
Thursday, which revealed an average value of 86.31%. The 4-week 
data obtained normal consumption values between 0% and 100% with 
average values between 32.21% and 58.92% except for Thursday, 
which had an average value of 76.2%. 


6.2. Discussions 


The main objective of this study was to develop a method of 
predicting anomalous power consumption in real time. Therefore, the 
method must be able to process large data streams quickly in a real- 
time environment. Predictions were obtained for both 8-week data 
and 4-week data. Standard ARIMA was also compared. The experi- 
ment was motivated by the parsimony principle. That is, for an IBM 
server with a quad-core processor and 64 GB memory, predictions 
based on 4-week data can be obtained in 3 min, which is 2 min faster 
than predictions based on 8-week data. Therefore, using less data 
reduces prediction time, which is suitable for a real-time environment. 
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Fig. 5. Electricity consumption data and prediction results of NNAR on Thursday of week 13 based on 8 weeks of historical data. 


3 T 
\ 
\ 
Historical data Prediction 
\ 
| 
\ 
e 27 
= \ 
% ] 
z ] 
g \ 
i= ] 
a. 
g i 
5 mL 
z} i 
> ] 
$ ] 
E i 
3 \ 
u o4 
] 
] 
] 
] 
\ 
\ 
a4 
0 1 2 3 4 5 


Week 


Fig. 6. Electricity consumption data and prediction result of ARIMA(3,1,0) on Friday 
of week 9 based on 4 weeks of historical data. 


The MAPE results for 9-week electricity consumption prediction 
data showed that 44.18% of predictions based on 8-week data had 
MAPEs below 10% while 37.78% of predictions based on 4-week data 
had MAPEs below 10%, which are considered satisfactory. The 
remainder had values between 10% and 20%, which are considered 
good. Moreover, comparisons of minimum, maximum and average 
values showed that the 4-week models had lower minimum and 
average values but higher maximum values compared to the 8-week 
models. These facts indicate that 4-week models are slightly more 
volatile than 8-week models. Further analysis based on observation 
showed that MAPE values larger than 10% indicate changing con- 
sumption patterns while MAPE values lower than 10% indicate that 
consumption is similar to that in the previous week. 

The average RMSE values for training models based on 8-week 
and 4-week data were 0.084 kWh and 0.088 kWh, respectively, 
which were small enough to compare daily consumption approx- 
imating 1 kWh. Further comparisons of minimum and maximum 
values also showed no significant differences between the models. 
Likewise, the average and minimum predicted values were almost 
identical, but the maximum values obtained by the 4-week model 
were significantly larger than those obtained by the 8-week model 
(0.51 kWh and 0.35 kWh, respectively. The values indicate that the 
4-week models are more error-prone compared to 8-week models. 

Comparison of MAE values in the two training models showed 
that the 4-week models had a lower minimum value and a larger 
maximum value compared to the 8-week models. However, 
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Fig. 7. Electricity consumption on Thursday of week 13. 


average values were similar (0.024 kWh and 0.028 kWh in the 
4-week model and 8-week model, respectively). Moreover, the 
two models obtained similar maximum and minimum values in 
the testing datasets. Also, average values were the same, i.e., 
0.1 kWh for both 4-week and 8-week models. 

Further comparison with standard ARIMA models showed that 4- 
week NNAR models performed better in terms of MAPE, RMSE and 
MAE. Although both NNAR and standard ARIMA produced compar- 
able prediction results, standard ARIMA method did not perform well 
in the anomaly detection stage. Since the ARIMA predictions also 
tended to converge to the mean value, they could not reveal power 
consumption patterns. Consequently, when 2-sigma rule was applied, 
data were categorized as anomalous data. In contrast, the NNAR 
method revealed power consumption patterns. 

Comparisons of 8-week and 4-week NNAR model performance 
using training data showed similar performance in terms of 
average MAPE, RMSE, and MSE values. Further evaluations using 
real-world data also yielded similar performances for both models. 
Although the 4-week models are more volatile and more error- 
prone compared to 8-week models, the difference is small, and the 
4-week models have the advantage of lower computational load. 
Thus, in terms of the trade-off between performance and compu- 
tational load, 4-week models are better than 8-week models. 


7. Conclusions 


The objective of this study was to develop a fast and accurate 
method of real-time anomaly detection. Anomaly detection is 
essential for effective building power demand management. 
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Table 3 
Anomaly detection by NNAR using 8-week and 4-week data. 


Dataset Day Consumption prediction Week 
9 10 
8-Week data stream Monday Normal consumption 98.47 - 
Anomalous consumption 1.53 - 
Tuesday Normal consumption 96.87 94.03 
Anomalous consumption 3.13 5.97 
Wednesday Normal consumption 95.35 95.56 
Anomalous consumption 465 444 
Thursday Normal consumption 97.36 9715 
Anomalous consumption 2.64 2.85 
Friday Normal consumption 99.51 93.12 
Anomalous consumption 049 6.88 
4-Week data stream Monday Normal consumption 99.1 - 
Anomalous consumption 0.9 - 
Tuesday Normal consumption 97.57 941 
Anomalous consumption 243 59 
Wednesday Normal consumption 95.69 95.62 
Anomalous consumption 4.31 4.38 
Thursday Normal consumption 12.65 96.94 
Anomalous consumption 87.35 3.06 
Friday Normal consumption 99.51 92.85 
Anomalous consumption 0.49 715 
Table 4 
Anomaly detection by ARIMA using 8-week and 4-week data. 
Dataset Day Consumption prediction Week 
9 10 
8-Week data stream Monday Normal consumption 20.76 - 
Anomalous consumption 79.24 - 
Tuesday Normal consumption 47.64 25.07 
Anomalous consumption 52.36 74.93 
Wednesday Normal consumption 34.37 26.81 
Anomalous consumption 65.63 73.19 
Thursday Normal consumption 0 100 
Anomalous consumption 100 0 
Friday Normal consumption 5.76 12.92 
Anomalous consumption 94.24 87.08 
4-Week data stream Monday Normal consumption 20.76 - 
Anomalous consumption 79.24 - 
Tuesday Normal consumption 1.81 25.14 
Anomalous consumption 98.19 74.86 
Wednesday Normal consumption 34.37 26.81 
Anomalous consumption 65.63 73.19 
Thursday Normal consumption 0 7.64 
Anomalous consumption 100 92.36 
Friday Normal consumption 99.51 32.99 
Anomalous consumption 049 67.01 


Increased government regulation and new certification schemes 
also require improved energy efficiency in buildings [7]. Likewise, 
anomalous power consumption indicates inefficient building 
power consumption. Therefore, inefficiencies can be quickly 
recognized so that the building manager and/or tenants can take 
appropriate action. 

Several researchers have proposed real-time anomaly detection 
methods. However, methods proposed so far require huge training 
data sets, produce static models [33] and have unsatisfactory 
accuracy [34]. Therefore, this study proposes a combination of 
prediction and two-sigma rule for detecting anomalous pattern in 
real time. The proposed prediction model uses a hybrid method of 
univariate time series ARIMA and ANN based on per-minute 
electricity consumption in an office space. 

The prediction accuracy of 4-week and 8-week NNAR models 
were evaluated and compared to standard ARIMA. The comparison 
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Average (%) 


11 12 13 14 15 16 17 
- 51.25 98.82 9514 99.65 87.57 92.78 89.1 
- 48.75 1.18 4.86 0.35 12.43 7.22 10.9 
98.06 86.18 90.14 99.58 98.4 93.61 91.94 94.31 
1.94 13.82 9.86 0.42 1.6 6.39 8.06 5.69 
91.11 99.03 97.57 96.04 89.79 95.28 87.57 94.14 
8.89 0.97 2.43 3.96 10.21 472 12.43 5.86 
98.96 98.61 96.11 97.85 99.65 98.96 83.82 96.5 
1.04 1.39 3.89 2.15 0.35 1.04 1618 3.5 
9146 99.65 94.86 97.08 99.03 7646 9493 94.01 
8.54 0.35 5.14 2.92 0.97 23.54 5.07 5.99 
- 51.32 98.89 95.83 99.65 99.17 93.26 91.03 
- 48.68 1.11 417 0.35 0.83 6.74 8.97 
97.57 85.42 90 100 91.81 93.47 8465 92.73 
2.43 1458 10 0 8.19 6.53 15.35 7.27 
97.99 99.03 97.29 96.04 8868 9486 87.29 94.72 
2.01 0.97 2.71 3.96 11.32 5.14 12.71 5.28 
98.89 98.54 95.97 96.04 99.72 99.17 83.26 868 
111 1.46 4.03 3.96 0.28 0.83 16.74 13.2 
91.18 99.58 94.86 97.22 98.61 76.25 95.28 93.93 
8.82 0.42 5.14 2.78 1.39 23.75 4.72 6.07 
Average (%) 
11 12 13 14 15 16 17 
- 30.28 2486 12.36 27.85 56.04 72.01 34.88 
- 69.72 7514 87.64 7215 43.96 27.99 65.12 
14.1 19.24 9.79 30.28 67.71 52.64 78.33 38.31 
85.9 80.76 90.21 69.72 32.29 47.36 21.67 61.69 
15.28 12.22 25.35 41.53 1319 62.43 89.72 35.66 
84.72 87.78 74.65 58.47 86.81 37.57 10.28 64.34 
100 98.61 97.71 9417 89.79 96.53 99.93 863 
0 1.39 2.29 5.83 10.21 3.47 0.07 13.7 
58.89 42.08 36.94 32.64 6.87 493 100 33.45 
41.11 57.92 63.06 67.36 93.13 95.07 0 66.55 
- 67.43 24.86 12.36 27.85 56.04 72.01 4019 
- 32.57 7514 87.64 7215 43.96 27.99 59.81 
141 20.69 9.79 30.28 67.71 52.64 67.71 32.21 
85.9 79.31 90.21 69.72 32.29 47.36 32.29 67.79 
15.28 12.22 2535 41.53 13.54 62.43 89.72 35.69 
84.72 87.78 7465 58.47 8646 37.57 10.28 64.31 
100 100 97.71 9417 89.79 96.53 99.93 76.2 
0 0 2.29 5.83 10.21 3.47 0.07 23.8 
31.04 42.08 9486 32.64 6.87 95 95.28 58.92 
68.96 57.92 5.14 67.36 93.13 5 472 41.08 


results confirm that the hybrid NNAR method obtains more 
accurate predictions of electricity consumption compared to 
standard ARIMA. Moreover, the models based on 4 weeks and 
8 weeks of rolling training data performed similarly. Specifically, 
average real-time predictions were between 89.1% and 96.5% for 
the 8-week data window, and between 86.8% and 94.72% for the 4- 
week data window. 

Furthermore, the model for predicting daily consumption during 
the following week could be developed in only 3 min using the 
previous 4-week consumption dataset. Therefore, anomalies could be 
detected in real time using a simple and quick calculation of 
differences between real-time and predicted consumption data. 
Consequently, the approach is suitable for use in a real-time environ- 
ment and large data environment. Finally, considering the parsimony 
principle and trade-off between performance and computational load, 
4 weeks of training data are adequate for achieving good results. 
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The research contributes to the formalization of a methodology 
for real-time detection of anomalous patterns in large data sets. 
Particularly, as the method has two stages, the prediction part 
helps building managers plan their energy consumption while the 
anomaly detection part helps building managers to identify 
unusual consumption of electricity by tenants. Moreover, the 
tenants can understand how their business activities consume 
energy through their anomalous energy consumption. 

As power consumption feedback is essential for maximizing 
energy savings, implementing optimization algorithms and adding 
explanatory variables into a building management system are the 
next stage of this research. Two other research directions are 
possible. Firstly, as time-based pricing policy becomes widely 
implemented in many countries, providing end users in residential 
buildings with feedback by suggesting operation time scheduling 
options for minimizing energy consumed by appliances is essen- 
tial. Secondly, implementing smart meters and sensors will rapidly 
accumulate electricity consumption data. Therefore, big data 
infrastructure must be applied to accommodate and analyze 
unstructured data resulted from the smart grids. 
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